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Abstract 

The natural forms of the Leibniz rule for the kth derivative of a product and 
of Faa di Bruno's formula for the fcth derivative of a composition involve the dif- 
ferential operator d k /dxi ■ ■ ■ dxk rather than d k /dx k , with no assumptions about 
whether the variables all distinct, or all identical, or partitioned into 

several distinguishable classes of indistinguishable variables. Coefficients appearing 
in forms of these identities in which some variables are indistinguishable are just 
multiplicities of indistinguishable terms (in particular, if all variables are distinct 
then all coefficients are 1). The computation of the multiplicities in this gener- 
alization of Faa di Bruno's formula is a combinatorial enumeration problem that, 
although completely elementary, seems to have been neglected. We apply the results 
to cumulants of probability distributions. 



1 Introduction 

Both the well-known Leibniz rule 



d k , s (k\ d e u d k l v 



\uv 



dx k -f— ' \ £ J dx l dx k e 
and the celebrated formula of Francesco Faa di Bruno 

(where the sum is over all /c-tuples (m 1; . . . , mi-) of non- negative integers satisfying the 
constraint rri\ + 2m 2 + 3m 3 + • • • + km^ = k) are formulas for kth derivatives of functions 
of functions of x. That is what the left sides of these identities share in common. The 
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right sides of both identities are sums whose terms have products of higher derivatives 
with respect to factors. 

All mathematicians know the combinatorial interpretation of the coefficients in the 
Leibniz rule (the number of size-£ subsets of a size-fc set), and all combinatorialists know 
the combinatorial interpretation of the coefficients in Faa di Bruno's formula (the number 
of partitions of a size-/c set into rrij parts of size j, for j — 1, . . . , k). However, the following 
two points appear not to be widely known: 

1. The natural form of these identities involves the differential operator 

Qk 

dx\ ■ ■ ■ dxk 

instead of d k /dx k . In that form, all coefficients on the right sides are 1. 

2. There should be no assumptions about whether the variables all dis- 
tinct, or all identical, or partitioned into several distinguishable classes of indis- 
tinguishable variables. When some variables become indistinguishable, so do some 
of the terms on the right sides of the identities. Indistinguishable terms then get 
collected, so that each constant coefficient of a term on the right side is that term's 
multiplicity. When all of the variables are indistinguishable, then the multiplicities 
are the coefficients in and (0) above. 

We will call the above Point 1 and Point 2. 

Finding the multiplicities in Point 2 applied to (2) is a combinatorial problem that 
may have escaped explicit treatment until the present paper, in which the solution is 
Proposition 4. The problem is that of enumeration of what we will call "collapsing parti- 
tions" . 

As an example of Point 2, if X2 and X3 collapse into two indistinguishable variables 
called x 2 , so that d 3 /(dxi 8x2 dx%) becomes d 3 /(dxi dxi), then the two-term sum 

dy d 2 y dy d 2 y 
dx 2 dxi dx 3 dx 3 dx\ dx 2 

collapses to the term 

2 _dy__ d 2 y 

dx 2 dxi dx 2 

with multiplicity 2. 

The chain rule and the product rule are enough to entail that the coefficients must be 
positive integers. But without Point 1 and Point 2, it is not obvious what, if anything, 
they enumerate. 

Two papers, Constantine and Savits [S] and Leipnik and Reid [H], give an identity ex- 
pressing (d kl+ - +kn /dx kl ■ ■ ■ dx^fiy) linear combination of products of derivatives 
of f(y) with respect to y and of y with respect to the independent variables. But neither 
of those sources mentions that as more and more variables become indistinguishable the 
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identity does not change except in the collection of newly indistinguishable terms. With- 
out that observation, the combinatorial content of the problem is invisible. Leipnik and 
Reid in 0, p. 1, wrote, "Obviously, 'pure' derivatives, such as - G q Z z \' Z2 ^ are easier to deal 

with than mixed derivatives like d fi^ z z •" But from our point of view, it will be maximally 

"mixed" derivatives like d A G/(dzi dz<i 8Z3 dz^) that are the easiest and most basic. 

Proposition 2 of this paper is partially anticipated by Terry Speed in jHj, page 382. 
That paper gives only the special case in which / is the exponential function, in which 
the derivatives appear as coefficients of power series, and is stated in an inconspicuous 
and somewhat tangential way that mixes it so thoroughly with the theory of cumulants 
in probability theory that it can be understood only by understanding what the paper is 
saying about cumulants. Speed wrote: ". . . the general results are most transparent when 
all . . . variables under discussion are taken to be distinct." That remark played a role in 
inspiring this paper. Its influence will be seen not only in our Proposition 1, but also 
in our treatment of product rules-a topic not directly relevant to that of Speed's paper 
and not mentioned there. Speed went on: "The identification of some or all [variables] 
at a later stage merely introduces extra factors, and at times these multiplicities are not 
particularly easy to calculate." The multiplicities are given by our Proposition 4. 

Unlike [H] and [H], Warren Johnson [B] states a version of Faa di Bruno's formula that 
is explicit about the combinatorial meaning of the coefficients. But Johnson treats only 
functions of one variable and gives nothing like Proposition 2 of the present paper. The 
same is true of the "compositional formula" on page 3 of Richard P. Stanley's treatise j!5j . 
Like Speed, Stanley gives a power-series version of the formula. He mentions the name of 
Faa di Bruno only in endnotes. One also finds a very combinatorics-flavored views of Faa 
di Bruno's formula in Other variations on the theme are in ^Uj, |Hj - and [TH] . 

We conclude this paper with the application of the results to cumulants. 



2 Partial derivatives and partitions of sets 



In the identity 



<9 3 _„ _„ ( d 3 y dy d 2 y t dy d 2 y 



8x18x28x3 \dx1dx2dx3 dxi 8x28x3 8x2 8x\8x 

dy 8 2 y dy dy dy 



3 



(3) 



dx 3 dxi 8x2 8x\ 8x2 8x3 

where y is a function of Xi,X2,X3, the terms correspond in an obvious way to the five 
partitions of the set { 1, 2, 3 }. We will see that this holds generally: the partial derivative 
(8 n /8xi ■ ■ ■ 8x n )e y is e y times the sum whose terms correspond in just this way to the 

partitions of the set { 1, . . . , n }. Using the notation (for example) 8 3 y j rX/ e {2 4 9} ® x i t° 

mean 8 3 y / 8x2 8x4 dxg , we can say 

8 n y = ySTY[ dlBly (A) 

dx x ..-dx n e e 2^n u ^ BdXj u 
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where the sum is over all partitions tc of the set { 1, . . . , n} and the product is over all 
of the parts B, or "blocks" as we will call them, of the partition n, and we denote the 
number of members of any set S by \S\. 

If we have f(y) instead of e y , then the orders of the derivatives of / must be mentioned. 
The order of each derivative of / is just the number of blocks in the partition. This is the 
first result that we will prove (in Section |3J): 

Proposition 1. 



gn 



dx\ ■ ■ ■ dx. 



-M 



b Pt LLjeB ux 3 



(5) 



Example 1. 



d 3 



dx\ 8x2 dx; 



d 3 y 



dxi dx 2 dx 3 



+ f"(y) 



1 block; 1st derivative of /. 

d 2 y dy d 2 y 



+ 



+ 



dy d 2 y 



dy 

dx\ 0x2 dxs ' 3x2 dx\ dx% ' dx% dx\ 8x2 



-2 



2 blocks in each partition; 2nd derivative of /. 



+ f"'(y) 



dy dy dy 



dxi dx 2 dx s 



3 blocks; 3rd derivative of /. 

Proposition 2. If some of the XiS become indistinguishable, then so do the corre- 
sponding terms in the sum; nothing else changes. 
This will also be proved in Sectional 

Example 2. Suppose that in Example 1, the two variables X2 and x% become indis- 
tinguishable from each other. Call them both X2- Then we have 



Q3 

dxi dxtJ ^ 



1 block; 1st derivative of /. 



f'(y) 



d 3 y 



dxi dx\ 



2 blocks in each partition; 2nd derivative of /. 



+ f"(v) 



dy d 2 y 
dxi dxi 



+ 2- 



dy d 2 y 



dx 2 dxi dx 2 



T 



The multiplicity 
of this term is 2. 



3 blocks; 3rd derivative of /. 



+ f"(y) 



dy ( dy 



dx\ V dx 



THE ELECTRONIC JOURNAL OF COMBINATORICS 13 (2006), #R1 



4 



The multiplicity mentioned above is how many formerly distinguishable terms get 
collected to form that term. The problem of finding such multiplicities is treated in 
Section HJ Applying the language of that section to Example 2, we would ask: how 
many partitions of the set { 1, 2, 3 } collapse to the partition {2} + {l,2}of the multiset 
{ 1, 2, 2 } when the set { 1, 2, 3 } collapses to the multiset { 1, 2, 2 }, i.e., when the members 
2 and 3 become indistinguishable? The answer in this case is 2. 



3 Proofs of the first two Propositions 

Proof of Proposition 1. This proof relies on this simple standard algorithm for con- 
verting a list of all partitions of { 1, . . . , n } into a list of all partitions of { 1, . . . , n + 1 }: 

1. To each partition of { 1, . . . , n }, add the 1-member-set { n + 1 } as a new block. 
This gives a list of some of the partitions of { 1, . . . , n + 1 }. 

2. To each block of each partition of { 1, . . . , n }, add n + 1 as a new member of the 
block. This gives a list of M additional partitions of { 1, . . . , n + 1 }. 

The union of these two lists clearly contains all partitions of{l,...,n + l}. 
In particular, it works when n = 0, since the empty set has exactly one partition 1 . 
That establishes the basis for a proof by mathematical induction on n. 

Next we use the Proposition in case n to prove the Proposition in case n+1. 



d 



,n+l 



dxi ■ ■ ■ dx n+ i 



f(y) 



E 

7T 







dx n ^ 



/ <W) (y) n 



d \B\, 



UjeB dx 3 



Ben HJ6B 



/ 



(M+i) 



(y) 



dy 



dx 



n+1 



n 

Ben 



dim, 



Y\j£B 9X 3 



d 



dx 



n+1 



n 



Q\B\y 



UjeB dx 3 



E 



/ 



(M+i) 



(y) 



dy 



dx n ^ 



n 

Ben 



U j& B dx 3 



+/ (W) (y)E 



Q\B\+1, 



Ben 



dx n+l Y[ jeB 9x 3 



n 

Cen:C^B 



d \c\, 



Ujec dx 3 



1 Perhaps as a result of studying set theory, I was surprised when I learned that some respectable 
combinatorialists consider such things as this to be mere convention. One of them even said a case could 
be made for setting the number of partitions to when n = 0. By stark contrast, Gian-Carlo Rota wrote 
in |13|. p. 15, that "the kind of mathematical reasoning that physicists find unbearably pedantic" leads 
not only to the conclusion that the elementary symmetric function in no variables is 1, but straight from 
there to the theory of the Euler characteristic, so that "such reasoning does pay off." The only other 
really sexy example I know is from applied statistics: the non-central chi-square distribution with zero 
degrees of freedom, unlike its "central" counterpart, is non-trivial. 
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Inside the last square brackets is a sum of two terms. The first term corresponds to step 1 
in our algorithm: we have added { n + 1 } as a new block to our partition 7rof{l,...,n}, 
getting a partition with \n\ + 1 blocks, of the set { 1, . . . , n + 1 }. The second corresponds 
to step 2 in our algorithm: we have added n + 1 to each block of our partition 7r of 
{ 1, . . . , n }, getting a partition with \ir\ blocks, of the set { 1, . . . , n + 1 }. We now have a 
sum over all partitions of {l,...,n + l}, each partition being represented as a product 
of partial derivatives, each partial derivative representing a block. And for each partition 
there is a factor f^(y), the order of the derivative being the number of blocks of the 
partition. This proves case n + 1, and the proof by induction on n is complete. ■ 

Proof of Proposition 2. Observe that if, in the argument above, we had differenti- 
ated at the (n + l)th step with respect to Xk for some k G { 1, . . . , n }, rather than with 
respect to x n+ i, then nothing would change except that some formerly distinguishable 
terms would become indistinguishable. ■ 

4 Multisets and collapsing partitions 
4.1 Definitions and conventions 

The first two bullet points and the fifth in the definition below are standard but make clear 
which notational conventions we will follow. The third and fourth may be less standard. 

• A multiset is a "set with multiplicities", i.e., positive integers, assigned to each 
member x, thought of as the number of times x occurs as a member. We will 
write { xi, . . . , x±, . . . , x n , . . . , x n }, indicating multiplicities with underbraces, or, in 



simple cases, for example { a, a, a, b, b, c, c, c, c, c }, the multiplicity being the number 
of times the member is named. In particular, we identify every set with a multiset 
in which every multiplicity is 1. 

Perhaps the most widely known example is the multiset of prime factors of a natural 
number: each prime factor has a multiplicity. 

• The size \S\ of a multiset S is the sum of the multiplicities. 

• The sum of multisets is given by term-by-term addition of multiplicities: 





{x 1 ,... 




£i+m 1 



I, 



'n+m, 



■n 



Only when sets are disjoint is their sum the same as their union. 



• A partition of a multiset expresses that multiset as a sum of multisets. 



• A partition of a positive integer expresses that integer as a sum of positive integers. 
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The next proposition is trivial but crucial. 
Proposition 3. 

• The concept of partition of a set is a special case of that of partition of a multiset. 

• If we identify any multiset in which "all members are equal" (i.e., there is just one 
member, whose multiplicity may be any positive integer) with the multiplicity of 
that one member (for example, the multiset {a, a, a} is identified with the number 
3), then the concept of partition of an integer becomes a special case of that of 
partition of a multiset. 



4.2 Collapsing partitions 

If the members 1, 2, 3, 4 of the set { 1, 2, 3, 4, 5, 6, 7, 8 } are made indistinguishable from 
each other and are called "1", and 5 and 6 are made indistinguishable from each other 
and are called "5", then we say that the set {1,2,3,4,5,6,7,8} has "collapsed" to the 
multiset { 1, 1, 1, 1, 5, 5, 7, 8 }. Then we can ask: how many set-partitions collapse to the 
multiset-partition 

{l,l,5} + {l,l,5} + {7,8}? 
It is a simple exercise to find that the answer is 6. Consequently, via the correspondence 

Qk x H hfen 



{ 1, • • • , 1, • • • , n, . . . , n } < — ► — ^ ^-j- = d T (6) 



hi kn 

between multisets and partial differential operators, Proposition 2 entails that the expan- 
sion of the partial derivative 

d 8 

9 { l,l,l,l,5,5,7,8}/(y) = dxidx 2 dx7dx8 f(y) (7) 

contains (among many others) this term: 

6/'" (! /) (8 (lM y) 2 W = erto) {^j ■ W 

(the order of the derivative of / is 3 because that is how many blocks are in this partition). 

An extreme case of "collapsing" is exemplified by the question: How many partitions 
of the set { 1, 2, 3, 4, 5, 6, 7, 8 } collapse to the partition 3 + 3 + 2 of the number 8 when 
all 8 members of the set { 1, 2, 3, 4, 5, 6, 7, 8 } collapse into indistinguishability? Again, a 
simple exercise shows that the answer is 280. In this extreme case where all members of 
the set become indistinguishable and partitions of the multiset become partitions of an 
integer, the answer to the problem of enumeration of collapsing partitions is well known 
to be given by the coefficients in Faa di Bruno's formula - in this case by the coefficient 
of 

f"f,,\ f d3 y\ 2 d2 y 
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in the expansion of (d 8 /dx 8 )f(y). 
In general, we have this result: 
Corollary to Propositions 1 and 2. Let r = { 1, . . . , 1, . . . , n, . . . , n }. Use the 

notation introduced in (JBJ) above. Then 

QfclH hfen 

drf(y) = fcl ^ /(y) = £ M / w (i/)'(^-^'-), 

ra, 1 • • • or*" r~ 

where the sum is over all partitions 7i+r 2 + - ■ ■ of the multiset r, the order of the derivative 
f(*'(y) is the number of terms in the partition T\ + r 2 + • • • , and the multiplicity M is the 
number of partitions of the set { 1, 2, 3, . . . , ki + ■ ■ ■ + k n } that collapse to the partition 
T\ + T2 + • • ■ of the multiset r when the set { 1, 2, 3, . . . , k\ + ■ ■ ■ + k n } collapses to the 
multiset r. 

In order to use it in the next result, we introduce a convention: 

Notational convention. For any multiset a let a!! denote the product of the facto- 
rials of the multiplicities of the members of a. For example, { 1, 1, 1, 1, 2, 2, 2 }!! = 4!3! = 
144. 

The next result will be proved in Section 
Proposition 4. 

Let t = { 1, . . . , 1, . . . , n, . . . , n }. Consider a partition 

T = Ti + ■ ■ ■ + Tj + T 2 + - ■ + 72 -1 

mi rri2 

in which 7i, ... are a// distinct (so that mi,m2, 7713, . . . are multiplicities of yet another 
sort). Denote this by 

t = miTi + m 2 r 2 H . 

Then the number of partitions of the set { 1, 2, 3, . . . , k% + ■ ■ ■ + k n } that collapse to the 
partition miTi+m2T 2 +m 3 T 3 +- ■ ■ of the multiset r when the set { 1, 2, 3, . . . , ki + ■ ■ • + k n } 
collapses to the multiset r is 

ki I • • • k ' 

'• "' (9) 



Ti!! mi T 2 !! m2 T 3 !! m 3 • • ■mi!m 2 !m 3 ! • • • 



4.3 The most extreme case 

In the most extreme case of indistinguishability of independent variables, the operator 
d k /dxi ■ ■ ■ dxk collapses to d k /dx k : 

d k ^ ^ / d \ lBl 



Be- v 7 
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where, again, the sum is over all partitions tt of { 1, . . . , k }. In this sum, two terms are 
indistinguishable whenever two partitions of the set { 1, . . . , k } both collapse to the same 
partition of the integer k when all of the members of { 1, . . . , k } become indistinguishable. 

In this extreme case, the multiplicities are given by the classic formula of Francesco 
Faa di Bruno (J2J). Francesco Faa di Bruno (1825 - 1888) was (in chronological order) a 
military officer, a mathematician, and a priest. He published this formula in [2] and [3] 
and was posthumously beatified by the Pope 2 . 

Alex Craik's Prehistory of Faa di Bruno 's Formula jB] points out that Faa di Bruno 
was anticipated in 1800 by L.F.A. Arbogast; see £Q. 

Although (j2J) is the well-known form of this identity, Warren P. Johnson has writ- 
ten in [S] (bottom of page 231), that (jlOJ) "is really the fundamental form" of Faa di 
Bruno's formula. We propose that the conjunction of our first two Propositions is more 
fundamental. 



4.4 Conservation of Bell numbers 

By now it should be clear that, when the derivative 

d n 



is expanded as a sum in terms of derivatives of f(y) with respect to y and derivatives of y 
with respect to whichever independent variables appear, then the sum of the coefficients 
is always the number B n of partitions of a set of n members. This is called the nth Bell 
number, in honor of Eric Temple Bell. The first several of these are B — 1, B\ — 1, B 2 — 
2, B 3 = 5,B 4 = 15, B 5 = 52, B e = 203, B 7 = 877, B 8 = 4140, .... For an account of these 
numbers, see As more and more of the n independent variables get identified with 
each other, the sum of the multiplicities never changes. 



5 Proof of Proposition 4 

Imagine k\ + ■ ■ ■ + k n Scrabble tiles. On the first k\ of these, the number "1" is written; 
on the next k 2 of them, "2" appears; and so on. These are partitioned into mi copies of 
the multiset t%, m 2 copies of r 2 , and so on. 

Permuting the k\ "l"s or the k 2 "2"s, etc., does not alter the set of mi + • ■ ■ + m n 
"words" . Thus there are k\\ ■ ■ ■ k n \ permutations of the k\ + ■ ■ ■ + k n tiles representing the 
partition 

r = m\T\ + m 2 r 2 H . 

Permuting the identical elements within any block of this partition does not alter which 
partition of r we have, and therefore the product k\\ ■ ■ ■ k n \ gets divided by ri!! mi r 2 \\ m2 
Neither does permuting the m« blocks identical to r^, and therefore it also gets divided by 
mi! • • • m.J. ■ 

2 An anonymous referee suggests that that pontiff may have been influenced more by Faa di Bruno's 
charitable than mathematical work. 
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6 Product rules 



In the identity 

8 3 



dx\ 8x2 dx 



\uv) 



3 

8 3 v du 8 2 v du d 2 v du d 2 v 

U ■ = 7— + -7— ■ -= 7— + -7— ■ -= 7— + 



dxi dx2 dxs dx\ 8x2 dx% 8x2 8x± 8x3 8x3 8x\ 8x2 
8 2 u 8v 8 2 u 8v 8 2 u 8v 8 3 u 

+ TA 77~ ■ -77- + TA 77— ' 77" + 



8x\ 8x2 8x3 dx\ 8x3 8x2 8x2 8x3 dx\ dx\ 8x2 8x3 

the terms correspond in an obvious way to the eight subsets of the set {1,2,3}. This 
exemplifies the first part of our next result. 
Proposition 5. 

8 n _ 8^U d^^V 

8x x --- 8x n ^ n ie5 dx 3 Uj^S dx 3 ' 

where the index S runs through the set of all subsets of { 1, . . . , n }. 

• If some of the variables become indistinguishable, then so do some of the terms in 
the sum; nothing else changes. 



Example 4. 



8 3 8 3 v du 8 2 v du 8 2 v 

luv) = u ■ - — —5- + - — • — + 2 



dxi 8x\ 8x\ 8x\ 8x\ 8x\ 8x2 8x\ 8x2 



^ 8 2 u 8v 8 2 u 8v 8 3 u 
dxi 8x2 8x 2 8x\ 8x\ 8x\ 8x\ 

In this case the solution of the combinatorial problem is simpler: 
Proposition 6. 

8x\ l ■ ■ ■ dxfr ~ , _ n ' " / =n VV Vn) dxi 1 ■ ■ ■ dxj n ' dx k i~ h ■ • • 



In the most extreme case, all of the independent variables become indistinguishable, and 
we have the familiar Leibniz rule The proofs of Propositions 5 and 6 are left as 
exercises. 

As far as the present writer knows, the second part of Proposition 5 is new. It identifies 
the easy combinatorial problem that Proposition 6 solves. Proposition 6 can be found in 
both 5J and 4J, p. 131, but without the combinatorial interpretation of the coefficients. 
The first part of Proposition 5 is an important special case of Proposition 6, and we do 
not know of any earlier explicit statement of it than that in the present paper (a referee 
says "it could be just about anywhere" and I have not succeeded in proving otherwise). 
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7 Cumulants 



The omitted fragment represented by the second ellipsis ". . . " in the first quote from 
Terry Speed in Section [T] is the word random. That is because Speed's topic is that of 
cumulants of random variables. 

For positive integers n, the nth cumulant functional K n assigns a real number K n (X) 
to real-valued random variables X. Let u = E(X) be the expected value of X. Then the 
nth central moment of X is E((X — u) n ). For n > 2, the nth cumulant shares with the 
nth central moment the properties of nth-degree homogeneity and translation-invariance: 

K n (cX) = C n K n (X), 
K n (X + c) = K n (X). 

Moreover, if the random variables X±, . . . , X m are independent, then 

K n (Xi H h X m ) — K n (Xi) H h K n (X m ). 

The nth central moment has this additivity property only when n < 3. In fact, when n = 
either 2 or 3, the cumulant is just the central moment. When n = 1, then the cumulant is 
the expected value. For all n, the nth cumulant is an nth-degree polynomial in the first 
n moments. 

Cumulants were introduced in the 19th century in ^1 by the Danish actuary Thorvald 
Thiele, who called them half-invariants; an English translation was published in 1931; see 
[TT] . They were first publicly given the name cumulants in 1931 by the statisticians Ronald 
Fisher and John Wishart in the name having been suggested to Fisher in private 
correspondence from the statistician Harold Hotelling. There are also joint cumulants 

k(Xi, . . . , X n ). 

When n = 2, this is just the covariance. All probabilists and statisticians know that the 
covariance between a random variable and itself is its variance. A similar thing happens 
with joint cumulants: When the n random variables collapse into indistinguishability, 
then the joint cumulant coincides with the nth cumulant of one random variable: 

K (X 1 _^£) = K n (X). 

n 

Beyond this talk of "collapsing into indistinguishability", a parallel between cumulants 
and the partial derivatives treated in the foregoing sections is seen in the identity that 
expresses the nth raw moment E(X n ) (not the nth central moment) in terms of the first 
n cumulants: 

m n ) = J2U^\( x )- en) 

7r Ben 

For random variables X±, . . . , X n , a similar identity holds: 

E(Xi ■■■X n ) = Y,\{<X l :ieB). (12) 

7r Be-rr 
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The identities and (|12|) completely characterize all of the cumulant functionals. 

That is how Terry Speed came to consider the question of these multiplicities, of which 
all he wrote was that they are not always easy to calculate. Workers with cumulants know 
what to do in the two opposite extreme cases: when none of the random variables are 
identified then all coefficients are equal to 1, and when all are identified then the classic 
Faa di Bruno formula (J2J) gives the coefficients. But if one were to judge by Speed's 
comments, they might seem at something of a loss in cases intermediate between "all" 
and "none" . That case is handled by our Proposition 4. 

Speed's topic and that of partial derivatives are not two disparate applications of our 
Proposition 4. The joint cumulant may be characterized as the coefficient of t\ • ■ ■ t n in 
the power-series expansion of 

logE(exp(t 1 X 1 + --- + t n X n )) 

(at least in the case in which all moments exist). More tersely stated, the joint cumulant- 
generating function is the logarithm of the joint moment-generating function. Since the 
coefficients of a power series of the form £\ « c h,-,irfii ' ' ' ' ' ' ? n0 are its partial 

derivatives at ii = • • • = t n — 0, Speed's topic is a special case of ours. 
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